Efficient Non-Oblivious Randomized Reduction for Risk Minimization with Improved Excess Risk Guarantee
نویسندگان
چکیده
In this paper, we address learning problems for high dimensional data. Previously, oblivious random projection based approaches that project high dimensional features onto a random subspace have been used in practice for tackling highdimensionality challenge in machine learning. Recently, various non-oblivious randomized reduction methods have been developed and deployed for solving many numerical problems such as matrix product approximation, low-rank matrix approximation, etc. However, they are less explored for the machine learning tasks, e.g., classification. More seriously, the theoretical analysis of excess risk bounds for risk minimization, an important measure of generalization performance, has not been established for non-oblivious randomized reduction methods. It therefore remains an open problem what is the benefit of using them over previous oblivious random projection based approaches. To tackle these challenges, we propose an algorithmic framework for employing non-oblivious randomized reduction method for general empirical risk minimizing in machine learning tasks, where the original high-dimensional features are projected onto a random subspace that is derived from the data with a small matrix approximation error. We then derive the first excess risk bound for the proposed non-oblivious randomized reduction approach without requiring strong assumptions on the training data. The established excess risk bound exhibits that the proposed approach provides much better generalization performance and it also sheds more insights about different randomized reduction approaches. Finally, we conduct extensive experiments on both synthetic and real-world benchmark datasets, whose dimension scales to O(10), to demonstrate the efficacy of our proposed approach.
منابع مشابه
The effect of Citrus Aurantifolia (Lemon) peels on cardiometabolic risk factors and markers of endothelial function in adolescents with excess weight: A triple-masked randomized controlled trial
Background: Childhood obesity is becoming a global problem and its incidence is increasing. The role of dietary intervention with fruits containing vitamin C and flavonoid to control obesity consequences in childhood has not been yet defined. Lemon (Citrus aurantifolia) peels contain flavonoid, pectin and vitamin C. We aimed to compare the effects of lemon peels and placebo on cardiometabolic r...
متن کاملEfficient Private Empirical Risk Minimization for High-dimensional Learning
Dimensionality reduction is a popular approach for dealing with high dimensional data that leads to substantial computational savings. Random projections are a simple and effective method for universal dimensionality reduction with rigorous theoretical guarantees. In this paper, we theoretically study the problem of differentially private empirical risk minimization in the projected subspace (c...
متن کاملBISTRO: An Efficient Relaxation-Based Method for Contextual Bandits
We present efficient algorithms for the problem of contextual bandits with i.i.d. covariates, an arbitrary sequence of rewards, and an arbitrary class of policies. Our algorithm BISTRO requires d calls to the empirical risk minimization (ERM) oracle per round, where d is the number of actions. The method uses unlabeled data to make the problem computationally simple. When the ERM problem itself...
متن کاملEstimation of Secondary Skin Cancer Risk Due To Electron Contamination in 18-MV LINAC-Based Prostate Radiotherapy
Introduction Accurate estimation of the skin-absorbed dose in external radiation therapy is essential to estimating the probability of secondary carcinogenesis induction Materials and Methods Electron contamination in prostate radiotherapy was investigated using the Monte Carlo (MC) code calculation. In addition, field size dependence of the skin dose was assessed. Excess cancer risk induced by...
متن کاملEmpirical Risk Minimization: Probabilistic Complexity and Stepsize Strategy
Empirical risk minimization (ERM) is recognized as a special form in standard convex optimization. When using a first order method, the Lipschitz constant of the empirical risk plays a crucial role in the convergence analysis and stepsize strategies for these problems. We derive the probabilistic bounds for such Lipschitz constants using random matrix theory. We show that, on average, the Lipsc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017